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A model-independent search for deviations from the Standard Model prediction is per- 
formed in e + p and e~p collisions at HERA II using all high energy data recorded by 
the HI experiment. This corresponds to a total integrated luminosity of 337 pb~ . All 
event topologies involving isolated electrons, photons, muons, neutrinos and jets with 
high transverse momenta are investigated in a single analysis. Events are assigned to 
exclusive classes according to their final state. A statistical algorithm is used to search 
for deviations from the Standard Model in distributions of the scalar sum of transverse 
momenta or invariant mass of final state particles and to quantify their significance. A 
good agreement with the Standard Model prediction is observed in most of the event 
classes. The most siginificant deviation is found in the fi-j-f channel in e + p collisions. 

1 Introduction 

At HERA electron^ and protons collide at a centre-of-mass energy of up to 319 GeV. These 
high-energy electron-proton interactions provide a testing ground for the Standard Model 
(SM) complementary to e + e~ and pp scattering. The approach presented here consists of a 
comprehensive and generic search for deviations from the SM prediction at large transverse 
momenta. The present analysis follows closely the strategy of the previous publication from 
the HI experiment [2]. All high Pt final state configurations involving electrons (e), muons 
(fi), jets (j), photons (7) or neutrinos (v) are systematically investigated. The complete 
HERA II data sample (2003-2007) is used, corresponding to a total integrated luminosity 
of 337 pb _1 shared between e + p (178 pb" 1 ) and e~p (159 pb _1 ) collisions. 

2 Data Analysis and Results 

All final states containing at least two objects (e, /j,, j, 7, v) with Pt > 20 GeV in the polar 
angle range 10° < 8 < 140° are investigated. All selected events are classified into exclusive 
event classes according to the number and types of objects detected in the final state (e.g. 
e ~h P"~3~ v i J-j-j-j-j)- The criteria used in the identification of each type of particle are 
chosen to ensure an unambiguous identification, while retaining high efficiencies |2|. All 
experimentally accessible combinations of objects have been studied and data events are 
found in 23 event classes. 

A precise and reliable estimate of all relevant processes present at high transverse mo- 
mentum in ep interactions is needed to ensure an unbiased comparison to the SM. Hence 
several Monte Carlo generators are used to generate a large number of events in all event 
classes, carefully avoiding double-counting of processes. The simulation contains the order 
a$ matrix elements for QCD processes, while second order a matrix elements are used to 
calculate QED processes. Additional jets are modelled using leading logarithmic parton 
showers as a representation of higher order QCD radiation. 

a In this paper "electrons" refers to both electrons and positrons, unless otherwise stated. 
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The event yields observed in each event class are presented and compared to the SM 
expectation in figures HJa) and (b) for e + p and e~p collisions, respectively. In each class, a 
good description of the number of observed data events by the SM prediction is seen. This 
demonstrates the good understanding of the detector response and of the SM processes in 
the considered phase space. Distributions of the scalar sum of transverse momenta ^ Pr of 
all objects are presented in figure [2] for e + p data. 
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Figurc 1: The data and the SM expectation for all event classes with observed data events 
or a SM expectation greater than one event. The results are presented separately for e + p 
(a) and e~p (b) collisions. 



3 Search for deviations 

In order to quantify the level of agreement between the data and the SM expectation and 
to identify regions of possible deviations, the same search algorithm as developed in [5] is 
used. All possible regions in the histograms of Pt and M a u distributions are considered. 
The number of data events (N b s ), the SM expectation (Nsm) an d its total systematic 
uncertainty (SNsm) are calculated for each region. A statistical estimator p is defined to 
judge which region is of most interest. This estimator is derived from the convolution of the 
Poisson probability density function (pdf) to account for statistical errors with a Gaussian 
pdf, G(b; Nsm, SNsm), with mean Nsm and width SNsm, to include the effect of non 
negligible systematic uncertainties [2] . The value of p gives an estimate of the probability of 
a fluctuation of the SM expectation upwards (downwards) to at least (at most) the observed 
number of data events in the region considered. The region of greatest interest (of greatest 
deviation) is the region having the smallest p- value, p m i n . 

The possibility that a fluctuation with a value p m i n occurs anywhere in the distribution 
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Figure 2: Distributions of ^2Pt for classes with at least one data event, for e + p data. The 
shaded areas show the regions of greatest deviation chosen by the search algorithm. 
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is estimated. This is achieved by creating hypothetical data histograms following the pdfs 
of the SM expectation. The algorithm is then run on those hypothetical histograms to find 
the region of greatest deviation and the corresponding p m j n is calculated. The probability 
P is then defined as the fraction of hypothetical data histograms with a p^ln ccma l to or 
smaller than the p m i n value obtained from the real data. P is a measure of the statistical 
significance of the deviation observed in the data. The event class of most interest for a 
search is thus the one with the smallest P value. Depending on the final state, a p mm -value 
of 5.7 • 10~ 7 ("5ct") corresponds to a value of — log 10 P, the negative decade logarithm of P, 
between 5 and 6. The overall degree of agreement with the SM can further be quantified 
by taking into account the large number of event classes studied in this analysis. Among 
all studied classes there is some chance that small P values occur. This probability can be 
calculated with MC experiments. A MC experiment is defined as a set of hypothetical data 
histograms following the SM expectation with an integrated luminosity equal to the amount 
of data recorded. The complete search algorithm and statistical analysis are applied to the 
MC experiments analgously as to the data. This procedure is repeated many times. The 
expectation for the P values observed in the data is then given by the distribution of P SM 
values obtained from all MC experiments. The probability to find in the MC experiments 
a P value smaller than in the data can be calculated and gives us the global significance of 
the observed deviation. 
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Figure 3: The — log 10 P values for the data event classes and the expected distribution from 
MC experiments as derived by investigating the ^2 Pt distributions in e + p (a) and e~p (b) 
data. 

The P values observed in the real data in all event classed are compared in figure [3] to 
the distribution of P SM obtained from the large set of MC experiments, normalised to one 
experiment. The comparison is presented for the scans of the ^ Pt distributions. All P val- 
ues range from 0.01 to 0.99, corresponding to event classes where no significant discrepancy 
between data and the SM expectation is observed. These results are in agreement with the 
expectation from MC experiments. The most significant deviation from SM predictions is 
observed in the fM-j-u event class and in e + p collisions with a value of — log 10 P equal to 1.7. 
In the previous HI analysis [5] based on HERA I data and dominated by e + p collisions, the 

b Due to the uncertainties of the SM prediction in the j-j-j-j and j-j-j-j-v event classes at highest M a u 
and 52 Pr (see [2]), where data events are observed, no reliable P values can be calculated for these classes. 
These event classes are not considered to search for deviations from the SM in this extreme kinematic 
domain. 



DIS2007 



largest deviation was also found in this event class, with — log 10 P = 3. 
4 Conclusions 

All the data collected with the HI experiment during HERA II running period (2003-2007) 
have been investigated for deviations from the SM prediction at high transverse momentum. 
All event topologies involving isolated electrons, photons, muons, neutrinos and jets are 
investigated in a single analysis. A good agreement between data and SM expectation is 
found in most event classes. In each event, class the invariant mass and sum of transverse 
momenta distributions of particles have been systematically searched for deviations using 
a statistical algorithm. No significant deviation is observed in the phase-space and in the 
event topologies covered by this analysis. The largest deviation from SM expectation is 
observed in the \i-j-v event class in e + p collisions. 
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